Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 31925 |
| Missing cells | 26101 |
| Missing cells (%) | 4.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 4.6 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 5 |
aids_diagnoses is highly overall correlated with aids_diagnosis_rate and 5 other fields | High correlation |
aids_diagnosis_rate is highly overall correlated with aids_diagnoses and 6 other fields | High correlation |
borough is highly overall correlated with uhf | High correlation |
concurrent_diagnoses is highly overall correlated with aids_diagnoses and 4 other fields | High correlation |
death_rate is highly overall correlated with aids_diagnoses and 3 other fields | High correlation |
deaths is highly overall correlated with aids_diagnoses and 7 other fields | High correlation |
gender is highly overall correlated with aids_diagnosis_rate | High correlation |
hiv_diagnoses is highly overall correlated with aids_diagnoses and 4 other fields | High correlation |
hiv_diagnosis_rate is highly overall correlated with aids_diagnoses and 3 other fields | High correlation |
hiv_related_death_rate is highly overall correlated with death_rate and 2 other fields | High correlation |
non_hiv_related_death_rate is highly overall correlated with death_rate and 2 other fields | High correlation |
percent_linked_to_care_within_3_months is highly overall correlated with percent_viral_suppression and 1 other fields | High correlation |
percent_viral_suppression is highly overall correlated with percent_linked_to_care_within_3_months | High correlation |
plwdhi_prevalence is highly overall correlated with aids_diagnosis_rate and 1 other fields | High correlation |
uhf is highly overall correlated with borough | High correlation |
year is highly overall correlated with percent_linked_to_care_within_3_months | High correlation |
hiv_diagnoses has 416 (1.3%) missing values | Missing |
hiv_diagnosis_rate has 416 (1.3%) missing values | Missing |
percent_linked_to_care_within_3_months has 13274 (41.6%) missing values | Missing |
aids_diagnoses has 337 (1.1%) missing values | Missing |
aids_diagnosis_rate has 337 (1.1%) missing values | Missing |
plwdhi_prevalence has 3553 (11.1%) missing values | Missing |
percent_viral_suppression has 1913 (6.0%) missing values | Missing |
death_rate has 1913 (6.0%) missing values | Missing |
hiv_related_death_rate has 1913 (6.0%) missing values | Missing |
non_hiv_related_death_rate has 1913 (6.0%) missing values | Missing |
hiv_diagnoses is highly skewed (γ1 = 23.92960598) | Skewed |
hiv_diagnosis_rate is highly skewed (γ1 = 79.22042465) | Skewed |
concurrent_diagnoses is highly skewed (γ1 = 23.91290325) | Skewed |
aids_diagnoses is highly skewed (γ1 = 176.2162638) | Skewed |
aids_diagnosis_rate is highly skewed (γ1 = 72.48822354) | Skewed |
plwdhi_prevalence is highly skewed (γ1 = 38.60594909) | Skewed |
deaths is highly skewed (γ1 = 125.5441877) | Skewed |
death_rate is highly skewed (γ1 = 20.35510758) | Skewed |
hiv_diagnoses has 14716 (46.1%) zeros | Zeros |
hiv_diagnosis_rate has 14716 (46.1%) zeros | Zeros |
concurrent_diagnoses has 21904 (68.6%) zeros | Zeros |
percent_linked_to_care_within_3_months has 1114 (3.5%) zeros | Zeros |
aids_diagnoses has 16621 (52.1%) zeros | Zeros |
aids_diagnosis_rate has 16621 (52.1%) zeros | Zeros |
plwdhi_prevalence has 3302 (10.3%) zeros | Zeros |
deaths has 17384 (54.5%) zeros | Zeros |
death_rate has 18051 (56.5%) zeros | Zeros |
hiv_related_death_rate has 22734 (71.2%) zeros | Zeros |
non_hiv_related_death_rate has 20497 (64.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-08 08:42:35.177323 |
|---|---|
| Analysis finished | 2024-05-08 08:42:49.601427 |
| Duration | 14.42 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2017.8714 |
| Minimum | 2011 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 2011 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2017 |
| median | 2018 |
| Q3 | 2020 |
| 95-th percentile | 2021 |
| Maximum | 2021 |
| Range | 10 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.7382534 |
|---|---|
| Coefficient of variation (CV) | 0.0013570009 |
| Kurtosis | 0.20240171 |
| Mean | 2017.8714 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.0023266 |
| Sum | 64420545 |
| Variance | 7.4980318 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 5184 | |
| 2018 | 5184 | |
| 2019 | 5184 | |
| 2020 | 5184 | |
| 2021 | 5184 | |
| 2011 | 1201 | 3.8% |
| 2012 | 1201 | 3.8% |
| 2013 | 1201 | 3.8% |
| 2014 | 1201 | 3.8% |
| 2015 | 1201 | 3.8% |
| Value | Count | Frequency (%) |
| 2011 | 1201 | 3.8% |
| 2012 | 1201 | 3.8% |
| 2013 | 1201 | 3.8% |
| 2014 | 1201 | 3.8% |
| 2015 | 1201 | 3.8% |
| 2017 | 5184 | |
| 2018 | 5184 | |
| 2019 | 5184 | |
| 2020 | 5184 | |
| 2021 | 5184 |
| Value | Count | Frequency (%) |
| 2021 | 5184 | |
| 2020 | 5184 | |
| 2019 | 5184 | |
| 2018 | 5184 | |
| 2017 | 5184 | |
| 2015 | 1201 | 3.8% |
| 2014 | 1201 | 3.8% |
| 2013 | 1201 | 3.8% |
| 2012 | 1201 | 3.8% |
| 2011 | 1201 | 3.8% |
borough
Categorical
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 498.8 KiB |
| Brooklyn | |
|---|---|
| Manhattan | |
| Queens | |
| Bronx | |
| Staten Island |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.6867659 |
| Min length | 3 |
Characters and Unicode
| Total characters | 245400 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All |
|---|---|
| 2nd row | All |
| 3rd row | All |
| 4th row | All |
| 5th row | All |
Common Values
| Value | Count | Frequency (%) |
| Brooklyn | 7980 | |
| Manhattan | 7315 | |
| Queens | 7315 | |
| Bronx | 5320 | |
| Staten Island | 3325 | |
| All | 670 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brooklyn | 7980 | |
| manhattan | 7315 | |
| queens | 7315 | |
| bronx | 5320 | |
| staten | 3325 | |
| island | 3325 | |
| all | 670 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 41895 | |
| a | 28595 | |
| t | 21280 | 8.7% |
| o | 21280 | 8.7% |
| e | 17955 | 7.3% |
| r | 13300 | 5.4% |
| B | 13300 | 5.4% |
| l | 12645 | 5.2% |
| s | 10640 | 4.3% |
| y | 7980 | 3.3% |
| Other values (11) | 56530 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 206825 | |
| Uppercase Letter | 35250 | 14.4% |
| Space Separator | 3325 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 41895 | |
| a | 28595 | |
| t | 21280 | |
| o | 21280 | |
| e | 17955 | |
| r | 13300 | 6.4% |
| l | 12645 | 6.1% |
| s | 10640 | 5.1% |
| y | 7980 | 3.9% |
| k | 7980 | 3.9% |
| Other values (4) | 23275 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 13300 | |
| M | 7315 | |
| Q | 7315 | |
| S | 3325 | 9.4% |
| I | 3325 | 9.4% |
| A | 670 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 3325 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 242075 | |
| Common | 3325 | 1.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 41895 | |
| a | 28595 | |
| t | 21280 | 8.8% |
| o | 21280 | 8.8% |
| e | 17955 | 7.4% |
| r | 13300 | 5.5% |
| B | 13300 | 5.5% |
| l | 12645 | 5.2% |
| s | 10640 | 4.4% |
| y | 7980 | 3.3% |
| Other values (10) | 53205 |
Common
| Value | Count | Frequency (%) |
| 3325 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 245400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 41895 | |
| a | 28595 | |
| t | 21280 | 8.7% |
| o | 21280 | 8.7% |
| e | 17955 | 7.3% |
| r | 13300 | 5.4% |
| B | 13300 | 5.4% |
| l | 12645 | 5.2% |
| s | 10640 | 4.3% |
| y | 7980 | 3.3% |
| Other values (11) | 56530 |
uhf
Categorical
HIGH CORRELATION 
| Distinct | 43 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 498.8 KiB |
| All | |
|---|---|
| Canarsie - Flatlands | 665 |
| Fordham - Bronx Park | 665 |
| High Bridge - Morrisania | 665 |
| Hunts Point - Mott Haven | 665 |
| Other values (38) |
Length
| Max length | 36 |
|---|---|
| Median length | 27 |
| Mean length | 17.518559 |
| Min length | 3 |
Characters and Unicode
| Total characters | 559280 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All |
|---|---|
| 2nd row | All |
| 3rd row | All |
| 4th row | All |
| 5th row | All |
Common Values
| Value | Count | Frequency (%) |
| All | 3995 | 12.5% |
| Canarsie - Flatlands | 665 | 2.1% |
| Fordham - Bronx Park | 665 | 2.1% |
| High Bridge - Morrisania | 665 | 2.1% |
| Hunts Point - Mott Haven | 665 | 2.1% |
| Kingsbridge - Riverdale | 665 | 2.1% |
| Northeast Bronx | 665 | 2.1% |
| Pelham - Throgs Neck | 665 | 2.1% |
| Bedford Stuyvesant - Crown Heights | 665 | 2.1% |
| Bensonhurst - Bay Ridge | 665 | 2.1% |
| Other values (33) | 21945 |
Length
| Value | Count | Frequency (%) |
| 17290 | 18.2% | |
| all | 3995 | 4.2% |
| park | 3325 | 3.5% |
| east | 3325 | 3.5% |
| heights | 2660 | 2.8% |
| side | 1995 | 2.1% |
| queens | 1995 | 2.1% |
| harlem | 1330 | 1.4% |
| bronx | 1330 | 1.4% |
| upper | 1330 | 1.4% |
| Other values (79) | 56525 |
Most occurring characters
| Value | Count | Frequency (%) |
| 63175 | 11.3% | |
| e | 46550 | 8.3% |
| a | 36575 | 6.5% |
| o | 33250 | 5.9% |
| t | 32585 | 5.8% |
| r | 30590 | 5.5% |
| n | 29260 | 5.2% |
| s | 29260 | 5.2% |
| l | 27940 | 5.0% |
| i | 27265 | 4.9% |
| Other values (40) | 202830 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 400340 | |
| Uppercase Letter | 77810 | 13.9% |
| Space Separator | 63175 | 11.3% |
| Dash Punctuation | 17290 | 3.1% |
| Other Punctuation | 665 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 46550 | |
| a | 36575 | |
| o | 33250 | 8.3% |
| t | 32585 | 8.1% |
| r | 30590 | 7.6% |
| n | 29260 | 7.3% |
| s | 29260 | 7.3% |
| l | 27940 | 7.0% |
| i | 27265 | 6.8% |
| h | 18620 | 4.7% |
| Other values (14) | 88445 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 9310 | |
| B | 7315 | 9.4% |
| H | 7315 | 9.4% |
| C | 5985 | 7.7% |
| P | 5320 | 6.8% |
| A | 4660 | 6.0% |
| F | 4655 | 6.0% |
| M | 3990 | 5.1% |
| W | 3325 | 4.3% |
| E | 3325 | 4.3% |
| Other values (13) | 22610 |
Space Separator
| Value | Count | Frequency (%) |
| 63175 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17290 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 665 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 478150 | |
| Common | 81130 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 46550 | 9.7% |
| a | 36575 | 7.6% |
| o | 33250 | 7.0% |
| t | 32585 | 6.8% |
| r | 30590 | 6.4% |
| n | 29260 | 6.1% |
| s | 29260 | 6.1% |
| l | 27940 | 5.8% |
| i | 27265 | 5.7% |
| h | 18620 | 3.9% |
| Other values (37) | 166255 |
Common
| Value | Count | Frequency (%) |
| 63175 | ||
| - | 17290 | 21.3% |
| . | 665 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 559280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 63175 | 11.3% | |
| e | 46550 | 8.3% |
| a | 36575 | 6.5% |
| o | 33250 | 5.9% |
| t | 32585 | 5.8% |
| r | 30590 | 5.5% |
| n | 29260 | 5.2% |
| s | 29260 | 5.2% |
| l | 27940 | 5.0% |
| i | 27265 | 4.9% |
| Other values (40) | 202830 |
gender
Categorical
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 498.8 KiB |
| All | |
|---|---|
| Men | |
| Women | |
| Male | |
| Female |
Length
| Max length | 11 |
|---|---|
| Median length | 3 |
| Mean length | 3.9033673 |
| Min length | 3 |
Characters and Unicode
| Total characters | 124615 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All |
|---|---|
| 2nd row | Male |
| 3rd row | Female |
| 4th row | Transgender |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| All | 8880 | |
| Men | 8640 | |
| Women | 8640 | |
| Male | 2880 | 9.0% |
| Female | 2880 | 9.0% |
| Transgender | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| all | 8880 | |
| men | 8640 | |
| women | 8640 | |
| male | 2880 | 9.0% |
| female | 2880 | 9.0% |
| transgender | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 25930 | |
| l | 23520 | |
| n | 17290 | |
| M | 11520 | |
| m | 11520 | |
| A | 8880 | 7.1% |
| W | 8640 | 6.9% |
| o | 8640 | 6.9% |
| a | 5765 | 4.6% |
| F | 2880 | 2.3% |
| Other values (5) | 30 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 92690 | |
| Uppercase Letter | 31925 | 25.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 25930 | |
| l | 23520 | |
| n | 17290 | |
| m | 11520 | |
| o | 8640 | 9.3% |
| a | 5765 | 6.2% |
| r | 10 | < 0.1% |
| s | 5 | < 0.1% |
| g | 5 | < 0.1% |
| d | 5 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 11520 | |
| A | 8880 | |
| W | 8640 | |
| F | 2880 | 9.0% |
| T | 5 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 124615 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 25930 | |
| l | 23520 | |
| n | 17290 | |
| M | 11520 | |
| m | 11520 | |
| A | 8880 | 7.1% |
| W | 8640 | 6.9% |
| o | 8640 | 6.9% |
| a | 5765 | 4.6% |
| F | 2880 | 2.3% |
| Other values (5) | 30 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 124615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 25930 | |
| l | 23520 | |
| n | 17290 | |
| M | 11520 | |
| m | 11520 | |
| A | 8880 | 7.1% |
| W | 8640 | 6.9% |
| o | 8640 | 6.9% |
| a | 5765 | 4.6% |
| F | 2880 | 2.3% |
| Other values (5) | 30 | < 0.1% |
age
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 498.8 KiB |
| All | |
|---|---|
| 30 - 39 | |
| 40 - 49 | |
| 50 - 59 | |
| 60+ | |
| Other values (3) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 5.4657792 |
| Min length | 3 |
Characters and Unicode
| Total characters | 174495 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All |
|---|---|
| 2nd row | All |
| 3rd row | All |
| 4th row | All |
| 5th row | 13 - 19 |
Common Values
| Value | Count | Frequency (%) |
| All | 7445 | |
| 30 - 39 | 4800 | |
| 40 - 49 | 4800 | |
| 50 - 59 | 4800 | |
| 60+ | 4800 | |
| 18 - 29 | 4320 | |
| 13 - 19 | 480 | 1.5% |
| 20 - 29 | 480 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 19680 | ||
| all | 7445 | 10.4% |
| 30 | 4800 | 6.7% |
| 39 | 4800 | 6.7% |
| 40 | 4800 | 6.7% |
| 49 | 4800 | 6.7% |
| 50 | 4800 | 6.7% |
| 59 | 4800 | 6.7% |
| 60 | 4800 | 6.7% |
| 29 | 4800 | 6.7% |
| Other values (4) | 5760 | 8.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 39360 | ||
| 0 | 19680 | |
| - | 19680 | |
| 9 | 19680 | |
| l | 14890 | 8.5% |
| 3 | 10080 | 5.8% |
| 4 | 9600 | 5.5% |
| 5 | 9600 | 5.5% |
| A | 7445 | 4.3% |
| 1 | 5280 | 3.0% |
| Other values (4) | 19200 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 88320 | |
| Space Separator | 39360 | |
| Dash Punctuation | 19680 | 11.3% |
| Lowercase Letter | 14890 | 8.5% |
| Uppercase Letter | 7445 | 4.3% |
| Math Symbol | 4800 | 2.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 19680 | |
| 9 | 19680 | |
| 3 | 10080 | |
| 4 | 9600 | |
| 5 | 9600 | |
| 1 | 5280 | 6.0% |
| 2 | 5280 | 6.0% |
| 6 | 4800 | 5.4% |
| 8 | 4320 | 4.9% |
Space Separator
| Value | Count | Frequency (%) |
| 39360 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19680 |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 14890 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 7445 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4800 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 152160 | |
| Latin | 22335 | 12.8% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 39360 | ||
| 0 | 19680 | |
| - | 19680 | |
| 9 | 19680 | |
| 3 | 10080 | 6.6% |
| 4 | 9600 | 6.3% |
| 5 | 9600 | 6.3% |
| 1 | 5280 | 3.5% |
| 2 | 5280 | 3.5% |
| 6 | 4800 | 3.2% |
| Other values (2) | 9120 | 6.0% |
Latin
| Value | Count | Frequency (%) |
| l | 14890 | |
| A | 7445 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 174495 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 39360 | ||
| 0 | 19680 | |
| - | 19680 | |
| 9 | 19680 | |
| l | 14890 | 8.5% |
| 3 | 10080 | 5.8% |
| 4 | 9600 | 5.5% |
| 5 | 9600 | 5.5% |
| A | 7445 | 4.3% |
| 1 | 5280 | 3.0% |
| Other values (4) | 19200 |
race
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 498.8 KiB |
| All | |
|---|---|
| Asian/Pacific Islander | |
| Black | |
| Other/Unknown | |
| White | |
| Other values (2) |
Length
| Max length | 22 |
|---|---|
| Median length | 15 |
| Mean length | 9.7658575 |
| Min length | 3 |
Characters and Unicode
| Total characters | 311775 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All |
|---|---|
| 2nd row | All |
| 3rd row | All |
| 4th row | All |
| 5th row | All |
Common Values
| Value | Count | Frequency (%) |
| All | 7925 | |
| Asian/Pacific Islander | 4800 | |
| Black | 4800 | |
| Other/Unknown | 4800 | |
| White | 4800 | |
| Latinx/Hispanic | 4320 | |
| Latino/Hispanic | 480 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| all | 7925 | |
| asian/pacific | 4800 | |
| islander | 4800 | |
| black | 4800 | |
| other/unknown | 4800 | |
| white | 4800 | |
| latinx/hispanic | 4320 | |
| latino/hispanic | 480 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 33600 | 10.8% |
| n | 33600 | 10.8% |
| a | 28800 | 9.2% |
| l | 25450 | 8.2% |
| c | 19200 | 6.2% |
| e | 14400 | 4.6% |
| s | 14400 | 4.6% |
| / | 14400 | 4.6% |
| t | 14400 | 4.6% |
| A | 12725 | 4.1% |
| Other values (18) | 100800 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 241450 | |
| Uppercase Letter | 51125 | 16.4% |
| Other Punctuation | 14400 | 4.6% |
| Space Separator | 4800 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 33600 | |
| n | 33600 | |
| a | 28800 | |
| l | 25450 | |
| c | 19200 | |
| e | 14400 | 6.0% |
| s | 14400 | 6.0% |
| t | 14400 | 6.0% |
| h | 9600 | 4.0% |
| k | 9600 | 4.0% |
| Other values (7) | 38400 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 12725 | |
| I | 4800 | 9.4% |
| B | 4800 | 9.4% |
| O | 4800 | 9.4% |
| P | 4800 | 9.4% |
| U | 4800 | 9.4% |
| W | 4800 | 9.4% |
| L | 4800 | 9.4% |
| H | 4800 | 9.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 14400 |
Space Separator
| Value | Count | Frequency (%) |
| 4800 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 292575 | |
| Common | 19200 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 33600 | 11.5% |
| n | 33600 | 11.5% |
| a | 28800 | 9.8% |
| l | 25450 | 8.7% |
| c | 19200 | 6.6% |
| e | 14400 | 4.9% |
| s | 14400 | 4.9% |
| t | 14400 | 4.9% |
| A | 12725 | 4.3% |
| h | 9600 | 3.3% |
| Other values (16) | 86400 |
Common
| Value | Count | Frequency (%) |
| / | 14400 | |
| 4800 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 311775 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 33600 | 10.8% |
| n | 33600 | 10.8% |
| a | 28800 | 9.2% |
| l | 25450 | 8.2% |
| c | 19200 | 6.2% |
| e | 14400 | 4.6% |
| s | 14400 | 4.6% |
| / | 14400 | 4.6% |
| t | 14400 | 4.6% |
| A | 12725 | 4.1% |
| Other values (18) | 100800 |
hiv_diagnoses
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 409 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 416 |
| Missing (%) | 1.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.936939 |
| Minimum | 0 |
|---|---|
| Maximum | 3379 |
| Zeros | 14716 |
| Zeros (%) | 46.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 38 |
| Maximum | 3379 |
| Range | 3379 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 68.89396 |
|---|---|
| Coefficient of variation (CV) | 6.2991996 |
| Kurtosis | 822.0801 |
| Mean | 10.936939 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 23.929606 |
| Sum | 344612 |
| Variance | 4746.3777 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14716 | |
| 1 | 4396 | 13.8% |
| 2 | 2307 | 7.2% |
| 3 | 1483 | 4.6% |
| 4 | 1073 | 3.4% |
| 5 | 823 | 2.6% |
| 6 | 670 | 2.1% |
| 7 | 510 | 1.6% |
| 8 | 443 | 1.4% |
| 9 | 350 | 1.1% |
| Other values (399) | 4738 | 14.8% |
| (Missing) | 416 | 1.3% |
| Value | Count | Frequency (%) |
| 0 | 14716 | |
| 1 | 4396 | 13.8% |
| 2 | 2307 | 7.2% |
| 3 | 1483 | 4.6% |
| 4 | 1073 | 3.4% |
| 5 | 823 | 2.6% |
| 6 | 670 | 2.1% |
| 7 | 510 | 1.6% |
| 8 | 443 | 1.4% |
| 9 | 350 | 1.1% |
| Value | Count | Frequency (%) |
| 3379 | 1 | |
| 3106 | 1 | |
| 2856 | 1 | |
| 2749 | 1 | |
| 2595 | 1 | |
| 2490 | 1 | |
| 2436 | 1 | |
| 2265 | 1 | |
| 2177 | 1 | |
| 2007 | 1 |
hiv_diagnosis_rate
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 1963 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 416 |
| Missing (%) | 1.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.139789 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 14716 |
| Zeros (%) | 46.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4.5 |
| Q3 | 29.3 |
| 95-th percentile | 102.16 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 29.3 |
Descriptive statistics
| Standard deviation | 1260.1023 |
|---|---|
| Coefficient of variation (CV) | 32.194919 |
| Kurtosis | 6282.0851 |
| Mean | 39.139789 |
| Median Absolute Deviation (MAD) | 4.5 |
| Skewness | 79.220425 |
| Sum | 1233255.6 |
| Variance | 1587857.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14716 | |
| 5.2 | 61 | 0.2% |
| 11.2 | 57 | 0.2% |
| 8.7 | 54 | 0.2% |
| 6 | 51 | 0.2% |
| 5.5 | 51 | 0.2% |
| 8.8 | 51 | 0.2% |
| 4.9 | 51 | 0.2% |
| 8 | 50 | 0.2% |
| 8.5 | 50 | 0.2% |
| Other values (1953) | 16317 | |
| (Missing) | 416 | 1.3% |
| Value | Count | Frequency (%) |
| 0 | 14716 | |
| 0.2 | 1 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| 0.4 | 3 | < 0.1% |
| 0.5 | 4 | < 0.1% |
| 0.6 | 2 | < 0.1% |
| 0.7 | 3 | < 0.1% |
| 0.8 | 10 | < 0.1% |
| 0.9 | 20 | 0.1% |
| 1 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 5 | |
| 1221.5 | 1 | < 0.1% |
| 961.4 | 1 | < 0.1% |
| 916.7 | 1 | < 0.1% |
| 773.6 | 1 | < 0.1% |
| 742.4 | 1 | < 0.1% |
| 663.5 | 1 | < 0.1% |
| 635.3 | 1 | < 0.1% |
| 592.9 | 1 | < 0.1% |
| 572 | 1 | < 0.1% |
concurrent_diagnoses
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 157 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 116 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.066302 |
| Minimum | 0 |
|---|---|
| Maximum | 640 |
| Zeros | 21904 |
| Zeros (%) | 68.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 8 |
| Maximum | 640 |
| Range | 640 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 12.799644 |
|---|---|
| Coefficient of variation (CV) | 6.1944694 |
| Kurtosis | 832.6 |
| Mean | 2.066302 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.912903 |
| Sum | 65727 |
| Variance | 163.8309 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 21904 | |
| 1 | 4098 | 12.8% |
| 2 | 1722 | 5.4% |
| 3 | 951 | 3.0% |
| 4 | 592 | 1.9% |
| 5 | 422 | 1.3% |
| 6 | 286 | 0.9% |
| 7 | 219 | 0.7% |
| 8 | 172 | 0.5% |
| 10 | 129 | 0.4% |
| Other values (147) | 1314 | 4.1% |
| (Missing) | 116 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 21904 | |
| 1 | 4098 | 12.8% |
| 2 | 1722 | 5.4% |
| 3 | 951 | 3.0% |
| 4 | 592 | 1.9% |
| 5 | 422 | 1.3% |
| 6 | 286 | 0.9% |
| 7 | 219 | 0.7% |
| 8 | 172 | 0.5% |
| 9 | 115 | 0.4% |
| Value | Count | Frequency (%) |
| 640 | 1 | |
| 583 | 1 | |
| 564 | 1 | |
| 490 | 1 | |
| 480 | 1 | |
| 452 | 1 | |
| 443 | 1 | |
| 438 | 1 | |
| 369 | 1 | |
| 349 | 1 |
percent_linked_to_care_within_3_months
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 125 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 13274 |
| Missing (%) | 41.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8178.2828 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 1114 |
| Zeros (%) | 3.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.81 |
| median | 1 |
| Q3 | 67 |
| 95-th percentile | 99999 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 66.19 |
Descriptive statistics
| Standard deviation | 27371.207 |
|---|---|
| Coefficient of variation (CV) | 3.346816 |
| Kurtosis | 7.345388 |
| Mean | 8178.2828 |
| Median Absolute Deviation (MAD) | 0.25 |
| Skewness | 3.0568908 |
| Sum | 1.5253315 × 108 |
| Variance | 7.4918299 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5915 | |
| 99999 | 1522 | 4.8% |
| 100 | 1145 | 3.6% |
| 0 | 1114 | 3.5% |
| 0.5 | 712 | 2.2% |
| 0.67 | 600 | 1.9% |
| 0.75 | 509 | 1.6% |
| 0.8 | 408 | 1.3% |
| 0.83 | 384 | 1.2% |
| 67 | 314 | 1.0% |
| Other values (115) | 6028 | |
| (Missing) | 13274 |
| Value | Count | Frequency (%) |
| 0 | 1114 | |
| 0.14 | 1 | < 0.1% |
| 0.2 | 3 | < 0.1% |
| 0.25 | 19 | 0.1% |
| 0.29 | 2 | < 0.1% |
| 0.33 | 141 | 0.4% |
| 0.38 | 2 | < 0.1% |
| 0.4 | 19 | 0.1% |
| 0.41 | 1 | < 0.1% |
| 0.42 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 1522 | |
| 100 | 1145 | |
| 95 | 5 | < 0.1% |
| 94 | 9 | < 0.1% |
| 93 | 7 | < 0.1% |
| 92 | 17 | 0.1% |
| 91 | 13 | < 0.1% |
| 90 | 19 | 0.1% |
| 89 | 27 | 0.1% |
| 88 | 46 | 0.1% |
aids_diagnoses
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 313 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 337 |
| Missing (%) | 1.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.9899329 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 16621 |
| Zeros (%) | 52.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3 |
| 95-th percentile | 25 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 564.22499 |
|---|---|
| Coefficient of variation (CV) | 56.479358 |
| Kurtosis | 31227.17 |
| Mean | 9.9899329 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 176.21626 |
| Sum | 315562 |
| Variance | 318349.84 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16621 | |
| 1 | 4397 | 13.8% |
| 2 | 2213 | 6.9% |
| 3 | 1437 | 4.5% |
| 4 | 959 | 3.0% |
| 5 | 770 | 2.4% |
| 6 | 592 | 1.9% |
| 7 | 450 | 1.4% |
| 8 | 351 | 1.1% |
| 9 | 333 | 1.0% |
| Other values (303) | 3465 | 10.9% |
| (Missing) | 337 | 1.1% |
| Value | Count | Frequency (%) |
| 0 | 16621 | |
| 1 | 4397 | 13.8% |
| 2 | 2213 | 6.9% |
| 3 | 1437 | 4.5% |
| 4 | 959 | 3.0% |
| 5 | 770 | 2.4% |
| 6 | 592 | 1.9% |
| 7 | 450 | 1.4% |
| 8 | 351 | 1.1% |
| 9 | 333 | 1.0% |
| Value | Count | Frequency (%) |
| 99999 | 1 | |
| 2366 | 1 | |
| 2106 | 1 | |
| 1949 | 1 | |
| 1712 | 1 | |
| 1529 | 1 | |
| 1518 | 1 | |
| 1440 | 1 | |
| 1307 | 1 | |
| 1087 | 1 |
aids_diagnosis_rate
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 1525 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 337 |
| Missing (%) | 1.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.94949 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 16621 |
| Zeros (%) | 52.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 18.2 |
| 95-th percentile | 69.365 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 18.2 |
Descriptive statistics
| Standard deviation | 1378.2115 |
|---|---|
| Coefficient of variation (CV) | 40.595942 |
| Kurtosis | 5255.4117 |
| Mean | 33.94949 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 72.488224 |
| Sum | 1072396.5 |
| Variance | 1899467 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16621 | |
| 5.6 | 81 | 0.3% |
| 5.5 | 73 | 0.2% |
| 5.1 | 72 | 0.2% |
| 10.6 | 67 | 0.2% |
| 5.3 | 65 | 0.2% |
| 5.4 | 62 | 0.2% |
| 5.7 | 61 | 0.2% |
| 5.8 | 61 | 0.2% |
| 5.2 | 60 | 0.2% |
| Other values (1515) | 14365 | |
| (Missing) | 337 | 1.1% |
| Value | Count | Frequency (%) |
| 0 | 16621 | |
| 0.4 | 5 | < 0.1% |
| 0.5 | 9 | < 0.1% |
| 0.7 | 14 | < 0.1% |
| 0.8 | 24 | 0.1% |
| 0.9 | 17 | 0.1% |
| 1 | 13 | < 0.1% |
| 1.1 | 25 | 0.1% |
| 1.2 | 32 | 0.1% |
| 1.3 | 17 | 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 6 | |
| 588 | 1 | < 0.1% |
| 581.7 | 1 | < 0.1% |
| 539.8 | 1 | < 0.1% |
| 494.5 | 1 | < 0.1% |
| 418.5 | 1 | < 0.1% |
| 414.3 | 1 | < 0.1% |
| 403.5 | 1 | < 0.1% |
| 392.9 | 1 | < 0.1% |
| 386.8 | 1 | < 0.1% |
plwdhi_prevalence
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 169 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 3553 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.20455 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 3302 |
| Zeros (%) | 10.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.2 |
| median | 0.6 |
| Q3 | 1.6 |
| 95-th percentile | 4.6 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 1.4 |
Descriptive statistics
| Standard deviation | 2586.9283 |
|---|---|
| Coefficient of variation (CV) | 37.928969 |
| Kurtosis | 1488.5249 |
| Mean | 68.20455 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 38.605949 |
| Sum | 1935099.5 |
| Variance | 6692198 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1 | 3336 | 10.4% |
| 0 | 3302 | 10.3% |
| 0.2 | 2480 | 7.8% |
| 0.3 | 1852 | 5.8% |
| 0.4 | 1585 | 5.0% |
| 0.5 | 1310 | 4.1% |
| 0.6 | 1113 | 3.5% |
| 0.7 | 945 | 3.0% |
| 0.8 | 821 | 2.6% |
| 0.9 | 709 | 2.2% |
| Other values (159) | 10919 | |
| (Missing) | 3553 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 3302 | |
| 0.1 | 3336 | |
| 0.2 | 2480 | |
| 0.3 | 1852 | |
| 0.4 | 1585 | |
| 0.5 | 1310 | 4.1% |
| 0.6 | 1113 | 3.5% |
| 0.7 | 945 | 3.0% |
| 0.8 | 821 | 2.6% |
| 0.9 | 709 | 2.2% |
| Value | Count | Frequency (%) |
| 99999 | 19 | |
| 27.9 | 1 | < 0.1% |
| 26.6 | 1 | < 0.1% |
| 26.1 | 1 | < 0.1% |
| 23.4 | 1 | < 0.1% |
| 23.3 | 1 | < 0.1% |
| 22.6 | 1 | < 0.1% |
| 22.4 | 1 | < 0.1% |
| 20.8 | 1 | < 0.1% |
| 20.3 | 1 | < 0.1% |
percent_viral_suppression
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 170 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 1913 |
| Missing (%) | 6.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 532.08241 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 292 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.6 |
| Q1 | 0.8 |
| median | 0.9 |
| Q3 | 1 |
| 95-th percentile | 87 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0.2 |
Descriptive statistics
| Standard deviation | 7166.9219 |
|---|---|
| Coefficient of variation (CV) | 13.469571 |
| Kurtosis | 188.65555 |
| Mean | 532.08241 |
| Median Absolute Deviation (MAD) | 0.1 |
| Skewness | 13.807222 |
| Sum | 15968857 |
| Variance | 51364769 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5193 | 16.3% |
| 0.8 | 831 | 2.6% |
| 0.86 | 819 | 2.6% |
| 0.88 | 796 | 2.5% |
| 0.89 | 781 | 2.4% |
| 0.83 | 758 | 2.4% |
| 0.9 | 703 | 2.2% |
| 0.87 | 702 | 2.2% |
| 0.81 | 677 | 2.1% |
| 0.85 | 667 | 2.1% |
| Other values (160) | 18085 | |
| (Missing) | 1913 | 6.0% |
| Value | Count | Frequency (%) |
| 0 | 292 | |
| 0.06 | 3 | < 0.1% |
| 0.08 | 2 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 2 | < 0.1% |
| 0.12 | 5 | < 0.1% |
| 0.13 | 3 | < 0.1% |
| 0.14 | 1 | < 0.1% |
| 0.16 | 2 | < 0.1% |
| 0.17 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 155 | 0.5% |
| 100 | 418 | |
| 99 | 1 | < 0.1% |
| 98 | 3 | < 0.1% |
| 97 | 15 | < 0.1% |
| 96 | 20 | 0.1% |
| 95 | 35 | 0.1% |
| 94 | 38 | 0.1% |
| 93 | 59 | 0.2% |
| 92 | 95 | 0.3% |
deaths
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 374 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.97397 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 17384 |
| Zeros (%) | 54.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3 |
| 95-th percentile | 31 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 793.08115 |
|---|---|
| Coefficient of variation (CV) | 52.963986 |
| Kurtosis | 15825.747 |
| Mean | 14.97397 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 125.54419 |
| Sum | 478044 |
| Variance | 628977.71 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 17384 | |
| 1 | 3937 | 12.3% |
| 2 | 1994 | 6.2% |
| 3 | 1247 | 3.9% |
| 4 | 873 | 2.7% |
| 5 | 702 | 2.2% |
| 6 | 520 | 1.6% |
| 7 | 461 | 1.4% |
| 8 | 362 | 1.1% |
| 9 | 335 | 1.0% |
| Other values (364) | 4110 | 12.9% |
| Value | Count | Frequency (%) |
| 0 | 17384 | |
| 1 | 3937 | 12.3% |
| 2 | 1994 | 6.2% |
| 3 | 1247 | 3.9% |
| 4 | 873 | 2.7% |
| 5 | 702 | 2.2% |
| 6 | 520 | 1.6% |
| 7 | 461 | 1.4% |
| 8 | 362 | 1.1% |
| 9 | 335 | 1.0% |
| Value | Count | Frequency (%) |
| 99999 | 2 | |
| 2040 | 1 | |
| 1906 | 1 | |
| 1898 | 1 | |
| 1824 | 1 | |
| 1751 | 1 | |
| 1678 | 1 | |
| 1645 | 1 | |
| 1423 | 1 | |
| 1375 | 1 |
death_rate
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 757 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 1913 |
| Missing (%) | 6.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.3827736 |
| Minimum | 0 |
|---|---|
| Maximum | 1000 |
| Zeros | 18051 |
| Zeros (%) | 56.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 7.9 |
| 95-th percentile | 27.3 |
| Maximum | 1000 |
| Range | 1000 |
| Interquartile range (IQR) | 7.9 |
Descriptive statistics
| Standard deviation | 32.033529 |
|---|---|
| Coefficient of variation (CV) | 4.3389559 |
| Kurtosis | 551.30993 |
| Mean | 7.3827736 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.355108 |
| Sum | 221571.8 |
| Variance | 1026.147 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 18051 | |
| 6.4 | 95 | 0.3% |
| 5.7 | 94 | 0.3% |
| 7.8 | 90 | 0.3% |
| 7.3 | 88 | 0.3% |
| 7.1 | 88 | 0.3% |
| 8.3 | 88 | 0.3% |
| 6.8 | 87 | 0.3% |
| 4.6 | 86 | 0.3% |
| 5.9 | 84 | 0.3% |
| Other values (747) | 11161 | |
| (Missing) | 1913 | 6.0% |
| Value | Count | Frequency (%) |
| 0 | 18051 | |
| 0.3 | 1 | < 0.1% |
| 0.6 | 3 | < 0.1% |
| 0.7 | 3 | < 0.1% |
| 0.8 | 8 | < 0.1% |
| 0.9 | 7 | < 0.1% |
| 1 | 11 | < 0.1% |
| 1.1 | 8 | < 0.1% |
| 1.2 | 14 | < 0.1% |
| 1.3 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000 | 16 | |
| 817.1 | 1 | < 0.1% |
| 500 | 19 | |
| 478.5 | 1 | < 0.1% |
| 394.8 | 1 | < 0.1% |
| 348.4 | 1 | < 0.1% |
| 333.3 | 17 | |
| 314.5 | 1 | < 0.1% |
| 296.9 | 1 | < 0.1% |
| 284.6 | 1 | < 0.1% |
hiv_related_death_rate
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 422 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 1913 |
| Missing (%) | 6.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4003.3887 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 22734 |
| Zeros (%) | 71.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 24.6 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 19599.773 |
|---|---|
| Coefficient of variation (CV) | 4.8957955 |
| Kurtosis | 20.034381 |
| Mean | 4003.3887 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.6939361 |
| Sum | 1.201497 × 108 |
| Variance | 3.8415109 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 22734 | |
| 99999 | 1201 | 3.8% |
| 2.6 | 123 | 0.4% |
| 1.9 | 121 | 0.4% |
| 1.8 | 117 | 0.4% |
| 1.7 | 114 | 0.4% |
| 2.4 | 106 | 0.3% |
| 2.1 | 105 | 0.3% |
| 1.5 | 105 | 0.3% |
| 1.4 | 103 | 0.3% |
| Other values (412) | 5183 | 16.2% |
| (Missing) | 1913 | 6.0% |
| Value | Count | Frequency (%) |
| 0 | 22734 | |
| 0.2 | 4 | < 0.1% |
| 0.3 | 8 | < 0.1% |
| 0.4 | 27 | 0.1% |
| 0.5 | 30 | 0.1% |
| 0.6 | 40 | 0.1% |
| 0.7 | 70 | 0.2% |
| 0.8 | 53 | 0.2% |
| 0.9 | 72 | 0.2% |
| 1 | 72 | 0.2% |
| Value | Count | Frequency (%) |
| 99999 | 1201 | |
| 1000 | 1 | < 0.1% |
| 500 | 6 | < 0.1% |
| 394.8 | 1 | < 0.1% |
| 333.3 | 2 | < 0.1% |
| 264 | 1 | < 0.1% |
| 250 | 2 | < 0.1% |
| 200 | 3 | < 0.1% |
| 172.2 | 5 | < 0.1% |
| 163 | 1 | < 0.1% |
non_hiv_related_death_rate
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 589 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 1913 |
| Missing (%) | 6.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4005.7664 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 20497 |
| Zeros (%) | 64.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 498.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 4.2 |
| 95-th percentile | 48.145 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 4.2 |
Descriptive statistics
| Standard deviation | 19599.299 |
|---|---|
| Coefficient of variation (CV) | 4.8927714 |
| Kurtosis | 20.034325 |
| Mean | 4005.7664 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.6939268 |
| Sum | 1.2022106 × 108 |
| Variance | 3.8413253 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20497 | |
| 99999 | 1201 | 3.8% |
| 4.9 | 96 | 0.3% |
| 3.8 | 93 | 0.3% |
| 3.9 | 92 | 0.3% |
| 3.4 | 88 | 0.3% |
| 2.9 | 86 | 0.3% |
| 5.3 | 85 | 0.3% |
| 4.4 | 83 | 0.3% |
| 5.5 | 83 | 0.3% |
| Other values (579) | 7608 | 23.8% |
| (Missing) | 1913 | 6.0% |
| Value | Count | Frequency (%) |
| 0 | 20497 | |
| 0.3 | 2 | < 0.1% |
| 0.4 | 2 | < 0.1% |
| 0.5 | 6 | < 0.1% |
| 0.6 | 10 | < 0.1% |
| 0.7 | 12 | < 0.1% |
| 0.8 | 20 | 0.1% |
| 0.9 | 19 | 0.1% |
| 1 | 23 | 0.1% |
| 1.1 | 23 | 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 1201 | |
| 1000 | 11 | < 0.1% |
| 500 | 9 | < 0.1% |
| 348.4 | 1 | < 0.1% |
| 333.3 | 11 | < 0.1% |
| 314.5 | 1 | < 0.1% |
| 250 | 6 | < 0.1% |
| 246.3 | 1 | < 0.1% |
| 238 | 1 | < 0.1% |
| 222.2 | 1 | < 0.1% |
| age | aids_diagnoses | aids_diagnosis_rate | borough | concurrent_diagnoses | death_rate | deaths | gender | hiv_diagnoses | hiv_diagnosis_rate | hiv_related_death_rate | non_hiv_related_death_rate | percent_linked_to_care_within_3_months | percent_viral_suppression | plwdhi_prevalence | race | uhf | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.183 | 0.119 | 0.000 | 0.161 | 0.289 | 0.401 | 0.228 | 0.092 | -0.009 | 0.224 | 0.286 | 0.154 | 0.260 | 0.208 | 0.159 | 0.000 | -0.141 |
| aids_diagnoses | 0.183 | 1.000 | 0.909 | 0.001 | 0.822 | 0.546 | 0.716 | 0.013 | 0.824 | 0.657 | 0.494 | 0.470 | -0.138 | -0.049 | 0.481 | 0.000 | 0.015 | -0.170 |
| aids_diagnosis_rate | 0.119 | 0.909 | 1.000 | 0.077 | 0.683 | 0.462 | 0.583 | 0.913 | 0.685 | 0.666 | 0.385 | 0.377 | -0.095 | -0.099 | 0.562 | 0.014 | 0.000 | -0.146 |
| borough | 0.000 | 0.001 | 0.077 | 1.000 | -0.173 | -0.141 | -0.196 | 0.036 | -0.178 | -0.149 | -0.108 | -0.114 | 0.060 | 0.097 | -0.243 | 0.000 | 0.857 | 0.000 |
| concurrent_diagnoses | 0.161 | 0.822 | 0.683 | -0.173 | 1.000 | 0.448 | 0.613 | 0.034 | 0.790 | 0.626 | 0.446 | 0.405 | -0.069 | 0.004 | 0.370 | 0.034 | 0.049 | -0.161 |
| death_rate | 0.289 | 0.546 | 0.462 | -0.141 | 0.448 | 1.000 | 0.817 | 0.010 | 0.468 | 0.343 | 0.587 | 0.737 | 0.040 | 0.068 | 0.373 | 0.024 | 0.027 | -0.126 |
| deaths | 0.401 | 0.716 | 0.583 | -0.196 | 0.613 | 0.817 | 1.000 | 0.011 | 0.628 | 0.453 | 0.572 | 0.651 | -0.113 | 0.009 | 0.535 | 0.013 | 0.040 | -0.111 |
| gender | 0.228 | 0.013 | 0.913 | 0.036 | 0.034 | 0.010 | 0.011 | 1.000 | -0.235 | -0.203 | -0.149 | -0.150 | -0.131 | -0.142 | -0.163 | 0.212 | 0.000 | 0.166 |
| hiv_diagnoses | 0.092 | 0.824 | 0.685 | -0.178 | 0.790 | 0.468 | 0.628 | -0.235 | 1.000 | 0.888 | 0.455 | 0.422 | -0.230 | -0.061 | 0.411 | 0.036 | 0.050 | -0.200 |
| hiv_diagnosis_rate | -0.009 | 0.657 | 0.666 | -0.149 | 0.626 | 0.343 | 0.453 | -0.203 | 0.888 | 1.000 | 0.324 | 0.298 | -0.198 | -0.119 | 0.483 | 0.017 | 0.000 | -0.180 |
| hiv_related_death_rate | 0.224 | 0.494 | 0.385 | -0.108 | 0.446 | 0.587 | 0.572 | -0.149 | 0.455 | 0.324 | 1.000 | 0.619 | 0.232 | 0.237 | 0.241 | 0.198 | 0.000 | -0.375 |
| non_hiv_related_death_rate | 0.286 | 0.470 | 0.377 | -0.114 | 0.405 | 0.737 | 0.651 | -0.150 | 0.422 | 0.298 | 0.619 | 1.000 | 0.196 | 0.207 | 0.278 | 0.198 | 0.000 | -0.318 |
| percent_linked_to_care_within_3_months | 0.154 | -0.138 | -0.095 | 0.060 | -0.069 | 0.040 | -0.113 | -0.131 | -0.230 | -0.198 | 0.232 | 0.196 | 1.000 | 0.643 | -0.191 | 0.291 | 0.301 | -0.615 |
| percent_viral_suppression | 0.260 | -0.049 | -0.099 | 0.097 | 0.004 | 0.068 | 0.009 | -0.142 | -0.061 | -0.119 | 0.237 | 0.207 | 0.643 | 1.000 | -0.232 | 0.083 | 0.114 | -0.445 |
| plwdhi_prevalence | 0.208 | 0.481 | 0.562 | -0.243 | 0.370 | 0.373 | 0.535 | -0.163 | 0.411 | 0.483 | 0.241 | 0.278 | -0.191 | -0.232 | 1.000 | 0.029 | 0.070 | -0.006 |
| race | 0.159 | 0.000 | 0.014 | 0.000 | 0.034 | 0.024 | 0.013 | 0.212 | 0.036 | 0.017 | 0.198 | 0.198 | 0.291 | 0.083 | 0.029 | 1.000 | 0.000 | 0.210 |
| uhf | 0.000 | 0.015 | 0.000 | 0.857 | 0.049 | 0.027 | 0.040 | 0.000 | 0.050 | 0.000 | 0.000 | 0.000 | 0.301 | 0.114 | 0.070 | 0.000 | 1.000 | 0.000 |
| year | -0.141 | -0.170 | -0.146 | 0.000 | -0.161 | -0.126 | -0.111 | 0.166 | -0.200 | -0.180 | -0.375 | -0.318 | -0.615 | -0.445 | -0.006 | 0.210 | 0.000 | 1.000 |
| year | borough | uhf | gender | age | race | hiv_diagnoses | hiv_diagnosis_rate | concurrent_diagnoses | percent_linked_to_care_within_3_months | aids_diagnoses | aids_diagnosis_rate | plwdhi_prevalence | percent_viral_suppression | deaths | death_rate | hiv_related_death_rate | non_hiv_related_death_rate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2011 | All | All | All | All | All | 3379.0 | 48.3 | 640.0 | 66.0 | 2366.0 | 33.8 | 1.1 | 71.0 | 2040 | 13.6 | 5.8 | 7.8 |
| 1 | 2011 | All | All | Male | All | All | 2595.0 | 79.1 | 480.0 | 66.0 | 1712.0 | 52.2 | 1.7 | 72.0 | 1423 | 13.4 | 5.7 | 7.7 |
| 2 | 2011 | All | All | Female | All | All | 733.0 | 21.1 | 153.0 | 66.0 | 622.0 | 17.6 | 0.6 | 68.0 | 605 | 14.0 | 6.0 | 8.0 |
| 3 | 2011 | All | All | Transgender | All | All | 51.0 | 99999.0 | 7.0 | 63.0 | 32.0 | 99999.0 | 99999.0 | 55.0 | 12 | 11.1 | 5.7 | 5.4 |
| 4 | 2011 | All | All | Female | 13 - 19 | All | 47.0 | 13.6 | 4.0 | 64.0 | 22.0 | 6.4 | 0.1 | 57.0 | 1 | 1.4 | 1.4 | 0.0 |
| 5 | 2011 | All | All | Female | 20 - 29 | All | 178.0 | 24.7 | 20.0 | 67.0 | 96.0 | 13.3 | 0.3 | 48.0 | 19 | 7.2 | 3.2 | 4.0 |
| 6 | 2011 | All | All | Female | 30 - 39 | All | 176.0 | 26.9 | 31.0 | 66.0 | 133.0 | 20.3 | 0.6 | 61.0 | 53 | 9.4 | 5.7 | 3.7 |
| 7 | 2011 | All | All | Female | 40 - 49 | All | 195.0 | 33.0 | 50.0 | 62.0 | 210.0 | 35.5 | 1.4 | 66.0 | 184 | 15.9 | 7.8 | 8.1 |
| 8 | 2011 | All | All | Female | 50 - 59 | All | 130.0 | 23.5 | 32.0 | 72.0 | 133.0 | 24.0 | 1.3 | 73.0 | 231 | 24.1 | 11.5 | 12.6 |
| 9 | 2011 | All | All | Female | 60+ | All | 57.0 | 6.7 | 23.0 | 68.0 | 60.0 | 7.1 | 0.3 | 81.0 | 129 | 33.5 | 10.6 | 22.9 |
| year | borough | uhf | gender | age | race | hiv_diagnoses | hiv_diagnosis_rate | concurrent_diagnoses | percent_linked_to_care_within_3_months | aids_diagnoses | aids_diagnosis_rate | plwdhi_prevalence | percent_viral_suppression | deaths | death_rate | hiv_related_death_rate | non_hiv_related_death_rate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 31915 | 2021 | Staten Island | Willowbrook | Women | 50 - 59 | Black | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 2.5 | 1.0 | 0 | 0.0 | 0.0 | 0.0 |
| 31916 | 2021 | Staten Island | Willowbrook | Women | 50 - 59 | Latinx/Hispanic | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 0.8 | 1.0 | 1 | 0.0 | 0.0 | 0.0 |
| 31917 | 2021 | Staten Island | Willowbrook | Women | 50 - 59 | Other/Unknown | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 0.0 | NaN | 0 | NaN | NaN | NaN |
| 31918 | 2021 | Staten Island | Willowbrook | Women | 50 - 59 | White | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 0.2 | 1.0 | 0 | 0.0 | 0.0 | 0.0 |
| 31919 | 2021 | Staten Island | Willowbrook | Women | 60+ | All | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | NaN | 0.8 | 0 | 0.0 | 0.0 | 0.0 |
| 31920 | 2021 | Staten Island | Willowbrook | Women | 60+ | Asian/Pacific Islander | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 0.0 | NaN | 0 | NaN | NaN | NaN |
| 31921 | 2021 | Staten Island | Willowbrook | Women | 60+ | Black | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | NaN | 1.0 | 0 | 0.0 | 0.0 | 0.0 |
| 31922 | 2021 | Staten Island | Willowbrook | Women | 60+ | Latinx/Hispanic | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 0.7 | 0.5 | 0 | 0.0 | 0.0 | 0.0 |
| 31923 | 2021 | Staten Island | Willowbrook | Women | 60+ | Other/Unknown | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 0.0 | NaN | 0 | NaN | NaN | NaN |
| 31924 | 2021 | Staten Island | Willowbrook | Women | 60+ | White | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 0.1 | 1.0 | 0 | 0.0 | 0.0 | 0.0 |